A CONSTRUCTION OF A /3-COALESCENT VIA THE PRUNING OF 

BINARY TREES 



ROMAIN ABRAHAM AND JEAN-FRANgOIS DELMAS 

Abstract. Considering a random binary tree with n labelled leaves, we use a pruning 
procedure on this tree in order to construct a /3(§, i)-coalescent process. We also use the 
continuous analogue of this construction, i.e. a pruning procedure on Aldous's continuum 
random tree, to construct a continuous state space process that has the same structure as 
the /3-coalescent process up to some time change. These two constructions enable us to 
obtain results on the coalescent process such as the asymptotics on the number of coalescent 
events or the law of the blocks involved in the last coalescent event. 



1. Introduction 

Let A be a finite measure on [0, 1]. A A-coalescent (H(t),t > 0) is a Markov process which 
takes values in the set of partitions of N* = {1, 2, . . .} introduced in |25j . It is defined via the 
transition rates of its restriction (Tv- n i(t),t > 0) to the n first integers: if n"[ n ](i) is composed 
of b blocks, then k (2 < k < b) fixed blocks coalesce at rate: 

(1) \,k= C u k -\l-u) b - k k{du). 







In particular a coalescent event arrives at rate: 

P) = 

k=2 V 7 



6,fc- 



As examples of A-coalescents, let us cite Kingman's coalescent (A(dx) = So(dx), see [23]). 
the Bolthausen-Sznitman coalescent (A(dx) = l(o,i) (x)dx, see [133), or /3-coalescents (A(dx) 
is the (3(2 — a, a) distribution with < a < 2, see [I2j [10], or the (3(2 — a, a — 1) distribution 
with 1 < a < 2, see [E]). We refer to the survey [9] for further results on coalescent processes. 

The goal of this paper is to give a new representation for the /3(3/2, l/2)-coalescent using 
the pruning of random binary trees. This kind of idea has already been used in [18j where the 
Bolthausen-Sznitman coalescent is constructed via the cutting of a random recursive tree. 

1.1. Pruning of binary trees. Now we describe the coalescent associated with the pruning 
of the random binary tree. 

We recall the normalization constant in the beta distribution for a > 0, b > 0: 

IM) = r( " )r( g = / x a -\l-x) h - x dx. 



T(a + b) 



(0,1) 



Fix an integer n and consider a uniform random ordered binary tree with n leaves (2n — 1 
vertices plus a root; In — 1 edges). Label these leaves from 1 to n uniformly at random. After 
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an exponential time of parameter: 
(3) A n 



('<- 1) {-y" ~ 



choose one of the n — 1 inner vertices of the tree uniformly at random, coalesce all the leaves 
of the subtree attached at the chosen node and remove that subtree from the original tree. 
Restart then the process with the resulting tree (using A& as the new parameter of time, 
where k is the new number of leaves) until all the leaves coalesce into a single one. 
Figure [T] gives an example of such a coalescence for n = 5. 




[2.4 J 




[1,24,5] 




[1,2,3,4,5] 



Figure 1. An example of the coalescence of a binary tree for n = 5. The 
selected node is in bold and the times between each picture are exponentially 
distributed with respective parameter A5, A4, A2. 



This defines a process (n [n] (t),t > 0). The main result of the paper is the following 
Theorem. 

Theorem 1.1. The process (H^ n \t),t > 0) defined as the coalescent associated with the 
pruning of the random binary tree is the restriction to {1, . . . ,n} of a /3(3/2, 1/2) -coalescent 
with coalescent measure: 



(4) 



A(du) 



l-u 



du. 



Let us remark that the /3-coalescents introduced in |12j and usually studied are /3(2 — a, a)- 
coalescents with 1 < a < 2. Here we have a = 1/2 and this does not enter the usual case. 
The reason is that for 1 < a < 2, the /3-coalescent comes down from infinity (n(i) has a.s. 
a finite number of blocks for any t > 0) which is not the case for a = 1/2 according to the 
criterion from [9] Theorem 3.5 or |25| as: 

- 1 A(du) 



< +00. 



u 
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1.2. Pruning of Aldous's CRT. A coalescent process may also be viewed as a process 
(I(t),t > 0) taking values in the interval partitions of [0,1]. The length of each interval 
represents the mass of a block, and the process represents blocks (whose sizes sum to 1) that 
merge together as time passes. We can go from this interval partition-valued process to the 
previous A-coalescent framework by the classical paintbox procedure (see for instance [26]): 
consider (t^)jgN a sequence of i.i.d. uniform random variables on [0, 1], independent of the 
process I and, for every t > 0, say that the integers i and j belong to the same block in H(t) 
if and only if J7j and Uj belong to the same interval in I(t). 

The continuous analogue of the binary tree is Aldous's continuum random tree (CRT) 
which can be obtained as the limit (in an appropriate sense) of the rescaled uniform binary 
tree with n leaves when the number of leaves n tends to infinity. A pruning theory of such 
a continuum tree has been introduced in [5] (see [2] for a general theory of the pruning of 
Levy trees) and will be recalled in Section 13.11 Using this pruning procedure, we are able to 
define an interval-partition- valued process in Section 13.21 which has the same structure as the 
/3(3/2, l/2)-coalescent except for the times (when sampling n points uniformly distributed on 
(0, 1), the time interval between two coalescences is not exponentially distributed). However, 
we conjecture that an appropriate change of time would be enough so that the interval- 
partition- valued process is really associated with the /3(3/2, l/2)-coalescent via the paintbox 
procedure. This gives a nice interpretation of the dust (or fraction of singletons) in the 
/3(3/2,l/2)-coalescent. 

1.3. Number of coalescent events and last coalescent event. The construction using 
discrete trees allows us to recover in Section 14.11 the asymptotic distribution of the number 
of coalescent events given by [19] in a more general framework, see also [21] . 

Proposition 1.2. Let X' n be the number of collisions undergone by (IV- n \t),t > 0). Then we 
have: 

±% V2Z, 

where Z has a Rayleigh distribution with density xe~ x2 ^ 2 1i x> q\. 

Let us remark that according to Section 5.4 in [19] Z is distributed as: 

o poo 

4= / dt e"^ 2 , 
V 71 " Jo 

where (St,t > 0) is a subordinator with E[exp(— XSt)] = exp(— t$(A)) where for A > 0: 

#(A) - jf (l - (1 - »)*) = 1 1 (l - (1 - „)») «■*"(! - uY^iu = 

(See [11] for more results on exponentials of Levy processes.) 

The continuum tree construction allows us to study the last coalescent event (see [18] for 
similar questions for the Bolthausen-Sznitman coalescent, and p3] for stationary CSBP). In 
Section 14.21 we consider the number E n of external branches or singletons involved in the last 
coalescent event as well as the number of blocks B n involved in the last coalescent event. 

Proposition 1.3. Let B n be the number of blocks and E n be the number of singletons involved 
in the last coalescent event of (H\ n \t) , t > 0). Then we have: 

{B n ,E n ) — > (B,E), 
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where (B—E, E) are finite random variables with generating function $ given, for p, € [0, 1] 
by: 

<5>(p,p*)=E[p B - E pf] 

(5) - , (l + , g(2 ) - ,o g (! + V^7. ~ *±*)) - (i^f^|) • 

Furthermore B — E is stochastically less than ( or equal to) E + 1 . 

Notice that $(1, 1) = 1 which indeed implies that £? and £7 are finite, that is £>„, and £7 n 
are of order 1. However, B — E and E have infinite expectation, as this can be checked from 
their generating functions given below. 

The generating function of E is given by: 

E[>f]=*(l,*) = l-21og ( 1 + ^T^)> 
and the generating function of B — E is given by: 

E [p B ~ E ] = <D(p, 1) = p (1 + log(4) - log (1 - p)) - Vplog (y^/=) ' 

There is a nice interpretation of the distribution of £> — £7 — 1 given after Proposition 14.21 
The generating function of B is given by: 

E [p B ] = d>(p, p) = p (l + log(2) - log (l + - p)) - Vplog Q±^2±^ . 

Of course we have a.s. B > 2. 

Remark 1.4. We can compute various quantities related to £7 and B. We have: 
P(£7 = 0) = l-21ogQj -0.19, P(£7=l) = l P(£7 = 2) = i 

P(£? - £7 = 0) = 0, P(B - £7 = 1) = log(4) - 1 ~ 0.39, ¥{B - £7 = 2) = -, 

3 

5 23 
P(£? = 0) = P(B = 1) = 0, P(£? = 2) = -, P(£7 = 3) = — • 

In particular, we have P(£7 > 5) < 25%, P(£? > 5) < 32% and P(£? - £7 > 5) < 11%. 

The paper is organized as follows. The proof of Theorem 1 1.1 1 is given in Section [2j The link 
with Aldous' CRT, presented in Section [3TT1 is given in Section [3721 using a pruning procedure; 
the reduced sub-trees are presented in Section 13.31 A proof and a comment on Proposition 
11.21 are given in Section f4.il Proposition 11.31 is proved in Sections 14.2.21 and l5l 

2. The /3(3/2, 1/2)-coalescent 

In order to prove Theorem 11.11 we first need to compute the rates at which k given 
blocks among n blocks coalesce. This is the purpose of the next Proposition. 

Proposition 2.1. For the coalescent of the random binary tree, we have for any 2 < k < n, 
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We recall the duplication formula for a > 0: 

i\ ^ r(2o) 



2 j 2 2 "- 1 r(a) ' 

Proof. Let us first remark that, by construction, since there are n — 1 internal vertices, we 
have: 

A„ ( \ \\ r- r(n- §) 



An,n = — = /5(n--,-)=Vvr 

m th 

Catalan numbers 



n - 1 V 2 ' 2 7 r ( n ) 
It is well known that the number of ordered binary trees with n leaves is given by the 



l{2n-2\_ (2n - 2)! 

On. — 



n \ n — 1 J (n — l)!n! 
Hence the number of ordered binary trees with n labelled leaves is: 

(6) Cn = „ !6 „ = M = ^!r ( „-i). 

Consider a binary tree with n labelled leaves. Let 2 < k < n. Fix k labels, say the k first. 
For these labels to coalesce at the same time, the leaves with these k labels must exactly lie 
in a single subtree of the initial tree. Therefore, to construct such a tree, we must choose an 
ordered binary tree with k leaves labelled from 1 to A; (Ck possibilities), choose an ordered 
binary tree with n — k + 1 leaves labelled from k to n (C n _k+i possibilities) and graft the 
tree with k leaves at the leaf labelled k. Then, for the k first labels to coalesce, the chosen 
branch must be the branch that links the two subtrees (and each branch is cut at rate \ n ,n)- 
Therefore, we have: 

_ CkCn-k+1 > 
^n,k — 7^ ^n,n 

2 2fc-2 / 1 X 2 2n-2fe / ]\ ^ 1 r(n-i) 
T (k--\ —r(n-k+-\-^—-—, rr-s/Sr- 



□ 



v 7 ^ V 2 J V* V 2 J 22n ~ 2 r (n - 1) r(n) 



i[ k-^,n-k+~ 



This proves that the process evolves like a A-coalescent with A given by (j4]) up to the time 
of the first merger. To finish the proof of Theorem II. 1\ it remains to prove that, after that 
first merger, the resulting tree is still a uniform binary tree with uniform labeled leaves. 

Let us fix k < n and let be a tree with k labelled leaves, one being labelled by the block 
[ii, . . . , i n -k+i], the others being labelled by singletons. We want to compute the probability 
of obtaining that tree after the first merger. From this tree, we construct a tree with n leaves 
by grafting, on the leaf of labelled by the block [ii, . . . , i n _k + i\, a tree with n — k + 1 leaves 
labelled by {h, ■ ■ ■ ,i n -k+i}- There are exactly C n _k+i different trees (this corresponds to 
the choice of the grafted tree). Moreover, the tree T/% is obtained after the first merger if the 
original tree is one of those, and if the chosen internal node is the leaf labelled by the block. 
Hence, the probability of obtaining is 

1 Cn-fc+l 

n - 1 C n 
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We note that this probability depends only of the number k of leaves of the tree and not 
on the tree itself, hence conditionally on merging n — k + 1 leaves, the resulting tree is still 
uniform among all the trees with k leaves. 

3. Links with the pruning of Aldous's CRT 

3.1. Aldous's CRT. In [BJ, Aldous introduced a continuum random tree (CRT) which can be 
obtained as the scaling limit of critical Galton- Watson trees when the length of the branches 
tends to 0. This tree can also be seen as a compact (with respect to the Gromov-Hausdorff 
topology) real tree. Indeed, a real tree is a metric space (T, d) satisfying the following two 
properties for every x,y G T: 

• (unique geodesic) There is a unique isometric map f x>y from [0, d(x, y)\ into T such 
that f x>y (0) = x and f x , y (d(x,y)) = y. 

• (no loop) If ip is a continuous injective map from [0, 1] into T such that </?(0) = x and 
<p(l) = y, then 

y([0 1 l])=/*,»([0,d(ar,y)]). 

A rooted real tree is a real tree with a distinguished vertex denoted and called the root. 

It is well known that every compact real tree can be coded by a continuous function (this 
coding is described below), Aldous's tree is just the real tree coded by a normalized Brownian 
excursion e. We refer to [24] for more details on real trees and their coding by continuous 
functions. 

Remark 3.1. In fact Aldous' CRT is coded by 2e. We omit here the factor 2 for convenience 
but some constants may vary between this paper and Aldous' results. Our setting corresponds 
to the branching mechanism ifi(\) = 2A 2 in [15] . 

Let e be a normalized Brownian excursion on [0, 1]. For s,t £ [0, 1] we set: 

d(s, t) = e(s) + e(i) - 2 inf e{u). 

u&[sAt,sVt] 

We then define the equivalence relation s ~ t iff d(s, t) = and the tree T is the quotient 
space T = [0, 1]/ ~. We denote by p the canonical projection from [0, 1] to T ■ The distance 
d induces a distance on T and we keep notation d for this distance. The metric space (T, d) 
is then a real tree. The metric space (T, d) can be seen as a rooted tree by choosing = p(0) 
as the root. 

For x,y G T, we denote by [x, y\ the range of the unique injective continuous path between 
x and y in T ■ We also define a length measure £(dx) on the skeleton of T (i.e. non-leaves 
vertices) by 

l({x,y})=d(x,y). 

Finally, for x,y G T, we denote by a(x, y) their last common ancestor i.e. the unique point 
in T such that: 

[0,x]n[0,y] = [0,a(x,y)]. 
For simplicity, we write for s,t G [0, 1], a(s,t) for a(p(s) , p(t)) . 

3.2. The interval-partition- valued process. As in [5j [T| , we throw points on the CRT 
"uniformly" on the skeleton of the CRT and add more and more points as time goes by. 
More precisely, we consider a Poisson point measure M(d0, dx) on [0, +oo) x T with intensity 
Ad6i{dx). 
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For 9 > 0, we define an equivalence relation ~g on [0, 1] by: 

J s = t or 
S ~ et ^ \M([0,8] x [0,a(M)D >0. 

We set (if, A; G /Cg) the equivalence classes associated with ~# non-reduced to a singleton. 
Let us remark that each it is an interval. 
Equivalently, we define 

T e = {x€T, M([0,e] x [0,x]) =0} 

which is the set of vertices that have no marks on their linage. The tree Te is called the 
pruned tree; it corresponds to the whole dust of the coalescent process. Then consider the 
set {T®,k S Kg) of the connected components oiT\Te which are the sub-trees that are 
grafted on the leaves of Te to get T (see Figure [2]). Then if is just the set of s 6 [0, 1] such 
that p(s) e T%. 




Figure 2. Left: Aldous's CRT with the marks. Right: the sub-trees con- 
structed from the marks 



By the definition of the mark process, for 6' > 6 we have 

S ~0 t S <~~>Ql t 

and consequently 

Vk e Kg, 3k' e K ff , 4 c 4'- 
Therefore, the process X = N j| , k € Kg), 9 > 0^ can be viewed as a process where several 

blocks coalesce together (with part of the dust) into a single larger block. On the picture on 
trees, when 9 increases, the number of marks also increases and when a mark appears on Te, 
some sub-trees above Te coalesce with part of Te- 

Let us remark that, as announced in the introduction, this process always has dust which 
corresponds to individuals that have no marks on their lineage i.e. that belong to Te ■ The 
dust has Lebesgue measure ag: 

&e= l{Af([o,0]x[0,p(s)])=o} ds = / 1 {p(s)eT e }ds. 
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We recall the distribution of (erg, 9 > 0) from [8] on the size process of a tagged fragment for 
a self-similar fragmentation, see also [2] Proposition 9.1 or Corollary 9.2 (but with (3 = 2). 
The distribution of (ag , 9 > 0) under the normalized Brownian excursion measure is given by 
(1/(1 + 4:Tg),9 > 0) where (tq,0 > 0) is a stable subordinator with index 1/2 with no drift, 
no killing, and Levy measure (27rx 3 )~ 1 / 2 dx on (0, oo): for A > 0, E[e~ Are ] = e~ e "^. 

3.3. The reduced tree with n leaves. We apply the paintbox procedure to this process. 
Consider n independent (and independent of the process) random variables, uniformly dis- 
tributed on (0, 1). This corresponds to choosing n leaves uniformly on Aldous's CRT. Then 
the law of the reduced tree, T n , containing these n leaves is given in [15], Section 3.3, or [7j, 
Section 4.3. The shape of the reduced tree is a binary tree with uniform probability on all 
ordered binary tree with n leaves. As the tree is binary, it is composed of 2n — 1 branches 
with lengths (hi, . . . , hm-i) and distribution: 



7~ _ i)\ Sn e 2S " 1 {'n>o,-,/i2n-i>o} ■ ■ ■ dh 2n -i, 



where s n = Y^—i^ ^* is the total length of the reduced tree. 

Since the reduced tree is binary and all its edges are identically distributed, the first 
mark that appears on the reduced tree is uniformly distributed among all the edges and we 
deduce that this continuous coalescent procedure has the same structure as a (3 (3/2, 1/2)- 
coalescent process. However this process is not stricto sensu a coalescent process as the time 
at which n leaves chosen at random undergo a coalescence is not exponentially distributed, 
but is distributed according to an exponential random variable with (random) parameter 
H n , with H n equal to 4 times the total length of the internal branches length. Thus H n is 
distributed as 4^^~J h^. Notice the random variables (hi, ■ ■ ■ , h2 n -i) are exchangeable and 
E[hi] = 2- 3 / 2 T(n - l/2)/r(n). In particular, we have: 

r(n-|) [2 



E[H n ] = 4(n - 1)EM = V2(n - 1) " = \ -X n . 

1 (nj V 7T 

So E[iT n ] corresponds (up to a scaling constant) to the rate of the /3(3/2, l/2)-coalescent 
starting with n individuals. 

Remark 3.2. We conjecture there exists a random time change (At,t > 0) such that the 
process process (^(I^ f ,k £ K,A t ),t > 0^ is the interval-partition- valued process associated 
with the /3-coalescent. However, we were not able to exhibit such a time change. 

4. Applications 

4.1. Number of coalescent events. Proposition II .21 is just a consequence of Theorem 6.2 
of |22j on the number of cuts used to isolate the root in a Galton- Watson tree with a given 
number of leaves. We must just remark that a binary tree with n leaves has 2n — 1 edges and 
that a Galton- Watson tree with binary branching conditioned to have n leaves is uniformly 
distributed among the binary trees with n leaves. 

Remark 4.1. According to [1], we also have the following equality in distribution: 

Z = 9 with 6 = f a e d9. 



o 



Notice that if a pruning mark appears twice or more on the same internal branch, only one 
will be taken into account as a coalescent event, and that the pruning marks which appear 
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on the external branch will not be taken into account as a coalescent event. Let X n be the 
number of pruning events of the reduced tree with n leaves and X' n the number of coalescent 
events. We deduce that X n is stochastically larger than (or equal to) X' n . But the a.s. 
convergence which appears in [1] (see also [20] for the fluctuations) gives that a.s.: 

lim 4^ = ^26. 



This implies that the number of marked external branches, say W n , is of order o(n). We shall 
see in Section [4.2.21 that it is in fact of order 1. 

4.2. Last coalescent event. 

4.2.1. The CRT framework. We refer to |15| for the construction of Levy trees and their 
main properties. Let T be a continuum Levy tree associated with the branching mechanism 
■0(A) = a\ 2 , i.e. coded as in Section 13.11 by a positive excursion e of \f2oiB where B is a 
standard Brownian motion. We denote by N the "law" of this tree when the coded function 
e is distributed according to the Ito measure (hence N is an infinite measure and is not really 
a distribution) and by N^ r ^ the same law when e is distributed as a normalized excursion of 
length r. We denote by m the mass measure on the tree, i.e. the image of the Lebesgue 
measure on [0, +oo) by the canonical projection p. 

Conditionally on T, let M be defined as in Section ^. 2l with the intensity 2a d6 i{ds] instead 
of 4d9£(ds). (We introduce the parameter a in order to make the references to [2] easier.) 
Consider the pruning of the tree T at time 9 > 0: 

T e = {seT; M({0,6] x [0, S J) = 0}. 

And we set o~g = m(Tg). This notation is consistent with the definition of ag in Section [3.21 
Using Theorem 1.1 of [4] (see also Proposition 5.4 of [2j), we get that under N, the pruned 
tree Tg is distributed as a Levy tree associated with the branching mechanism ipg defined by: 

il>B(u) = il>(u + 0)-il>(0). 

Moreover, using Lemma 3.8 of [2], we have the following Girsanov formula that links the law 
of Tg with that of T: for every nonnegative measurable functional F on the space of trees, 
we have: 

(7) n[F{Te)] =N [F{T)e- ae2 

Let n be a positive integer. We consider under N (or N^ r ^) conditionally given the tree 
T, n leaves xi,...,x n i.i.d., uniformly chosen on the set of leaves, i.e. sampled with the 
probability m(dx)/a. For 9 > 0, let (T 3 ,j S J) be the connected components of T\Te- We 
write Y^n) = Yli=i ^-{x e eT e } the number of chosen leaves on the subtree Tg, and for k > 1, 
Yg(n) = Card {j £ J',YH=i ^-{x^Ti} = ^} the number of subtrees with exactly k chosen 
leaves. In particular, we have: 

(8) Y e \n) + Y,kY e k {n)=n. 

We set Ng(n) = ^2 k>0 Yg(n) the number of chosen leaves on Tg plus the number of subtrees 
with chosen leaves. For convenience, we shall consider: 

(9) Yg(n) = ^ Y e(n) = Ng{n) - Y e °(n) - Y e \n). 

k>2 
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Let T n be the reduced tree of the chosen leaves: that is the smallest connected component 
of T containing the root and {xi, 1 < i < n}. Let 7^ n be the reduced tree T n pruned at 
time 9 > 0: 

V = {ser ; M([o,0]x[o, s ]) = o}. 

Notice that Ng is the number of leaves of T g n (with the convention that the root is not a leaf). 
Define the last pruning event as: 

L n = \ni{9 > 0; N e {n) = 1}. 

We define: 

(10) U n = N Ln _(n), V n = Yl n _{n) + Yl n _{n), and W n = Yl n _(n). 

We can interpret U n as the number of leaves of the pruned reduced tree, V n — W n as the 
number of chosen leaves of the pruned reduced tree and W n as the number of subtrees with 
only one chosen leaf just before the last pruning event. 

4.2.2. Proof of Proposition \1.3X Let B n be the number of blocks and E n be the number of 
singletons involved in the last coalescent event of (n\ n \t),t > 0). Using the link with the 
pruning of CRT from the previous Section, we have that (B n ,E n ) is distributed as (U n , V n ) 
under N^. 

Following Remark 14.11 we can see Vn as the sum of V n — W n (number of leaves of T n with 
no mark before the last pruning event) and the number W n of leaves of T n with no mark on 
their ancestral lineage until the mark corresponding to the last coalescent pruning but for 
the external branch, where there is at least one mark. 

Before giving the asymptotic distribution of (U n , V n ), we need to introduce some notations. 

For a > 0, b > such that a + b > we define Ao(a, b) as: 



(11) A (a,6) = 4 



- 1 lo A + ^+v^TTO if 1 + 6 _ 2a/0 , 

y/\l + b-2a\ \l + y/b-y/\l + b-2a\J 
2 

if 1 + b - 2a = 0. 



+ 

It is easy to check that the function Ao is continuous in (a, b) and that: 

(12) lim A (a, b) + \og(Vb + a) = log(2). 

(a,b)->(0,0) 

We set for a > and b > (with the convention 1(0, 0) = 1): 

(13) I{a, b) = l + log(2) - \og(Vb + a) - A (a, b). 

Notice that the function / is continuous on [0, +oo) 2 (and in particular at (0,0)). 
We shall prove the next result in Section [5j 

Proposition 4.2. Under N^, as n goes to infinity, (U n ,V n ,V n ) converges, in distribution, 
to a finite random variable (U, V,W). And the distribution of (U,V,W) is characterized by 
the following generating function, for p, po,Pi £ [0, 1] : 

\ tc r U-V V-W w~\ rh P + Pl -, \ 
*(p,Po,Pi) = IE \p p p 1 J=p/ll — ,1-pol- 

Notice that the random variable U — V — 1 is distributed as W. Since W < V, this implies 
that U — V —1 is stochastically smaller than V. This last remark and Proposition 14.21 readily 
implies Proposition 11.31 
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The generating function of V — W is given, for pq G [0, 1] by E 



1(0,1 -po), that 



is: 



E 



1 + log(2) - log y/l-po 



1 



:l0g 



i + vr^7o + V^po 



V 2 - Po V 1 + V 1 - Po - V 2 - PoJ ' 

The generating function of W is given by, for pi £ [0, 1] by: 



E[p\ 



/ ( o) = 1 + log(4) - log (1 - Pl ) - log ( 

2 ) \fP\ \l-y/pi 



5. Proof of Proposition 14.21 

In order to compute the generating function that appears in Proposition 14.21 it is easier to 
work first under N (and then condition on a to have the result under and to consider a 
Poissonian number of chosen leaves. Let A > 0. Under N or N^ r ^, conditionally given the tree 
T, we consider a Poisson point measure J\f = ^ on T with intensity Am. We denote 
by N = N{T) the number of chosen leaves. The law of the total mass a = m(7~) of T under 
N is given by the following Laplace transform: 



(14) 



N[l - e~ Ka ] = V _1 ( K ) = 



Q. 



Conditionally on a, the random variable N is Poisson with parameter Act. Therefore, by first 
conditioning on a, we get for k > 1: 



(15) 



\ k 

MN = k} = — N 
k\ 



1 



a r(fe-i) 

20F V a T(k + 1) 



Let 6 > 0. From the special Markov property, Theorem 5.6 in [2] (see also |3j), we get that 
under N, conditionally on a$, the random variables (Yq(N), k > 0) are independent, Yg(N) is 
Poisson with parameter Xag and for k > 1, Yq(N) is Poisson with parameter 2a9aQN{N = k]. 

For a G [0, 1], we set: 



fo(a) =N a '/> 
Lemma 5.1. VFe /iaue; 

(16) f (a) = e + ^-y/6 o + 26 1 VT 



L {iV>0} 



a — doa, 



with 

5 = i 
Notice that: 

(17) 



> 2 + - + 20j-(l- P ), dx = epJ-, y 

a \ a \ a 



2 — — Po 

a 



(P~ Pi)- 



30 



2 + -(i - po) + -(2 - p - Pl ) > e 2 > o. 



A 

a V a 

Consequently the right hand side in (fT6|) is well defined. 
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Proof. We set fi = — log(p) and ^ = — log(/9j) for z 6 {0, 1} and k = — log(a). We have: 
Ma) 



N 



-li Y°(N)- tJ , 1 YMN)-ij,Y g (N)-KN 



N 



N 



L {iV=0} 

-( W+K )y e 1 (iv) 



n4 



-(^+fc K )y fl fe (A>) 



fc>2 



-N[iV = 0], 



using Equation (jSJ) and the independence of the variables Yg(N) conditionally given ag. Now, 
as the variables Yg(N) are conditionally given oq Poisson variables, and thanks to (|15j) . we 
have: 

fe(a) = N 



e -7^e _ e -AcT 



with 

7 = A(l - e _(/t+W)) ) + 2a6>AN 



ere 



-Act 



d_ e -(K+m)) + 2a#^^-N ^e- Aa (1-e 



k>2 



A(l - e - (K+w) ) + 2a6>AN [<re-H (1-e 



-((C+ftl)> 



2a9 f N 



1-e" 



-Act 



(1-e- 



AN 



cr e 



1-e 



-(fl+K.) 



) + e^N 



1-e 



-A(l-e- K )o- 



We now use Formula ([T4"]) to get 



7 = A(l - e -( re+ ^°)) + 0v / A^e- K (e-' 1 - e"^ 1 ) + 20>/Aa(l - e"' 1 ) + 26 e - " VAaVl - e" K 
= 20VAa(l - e - ^) + A + 20 e - ' 1 v^v 7 !^ - a f A e^ -0\ZA^(e _/1 - e"^ 1 )) . 



Using the special Markov property of |2J, Theorem 5.6, we get that, conditionally given 
o~q, a is distributed as o~q + ^ <jj where the oVs are the atoms of a Poisson point measure of 
intensity 2aQooH\da\. This yields: 



Ma) = N 

To conclude, we use Girsanov Formula ((7J) to get: 

/e(a) = N 



e -7^s _ e -(Te(A+26»v / A^) 



e -(7+a6» 2 )cr _ e -o-(a6» 2 +A+26»yAa) 



+ \ - 

a 



e 2 + ^. 

a 



This gives the result. 
Let us set 



□ 



-n{u n -v n )-no(y n -w n )-mW n 



A n = N« 

To prove Proposition 14.21 it is enough to prove that: 

(18) lim A n = pl(l-P±^,l-p ). 

As the CRT is coded by a Brownian excursion, it enjoys a scaling property, namely the law 
of rT under N^ r > is those of T under N*- 1 ) (where rT means that we multiply the distance on 
the tree by a factor r). Consequently, the mark process (defined as a Poisson point measure 
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with intensity proportional to the length measure) also satisfies a scaling property. It is then 
easy to deduce that the law of (U n , V n , W n ) doesn't depend on a. So, we have: 



A, = N« 
for every positive r. We set: 
(19) 

By conditioning on a, we get: 
A n F 

For n > 1, we set: 
(20) F e (ra,r) = N (r 
Recall: 



- fJ, ( U„ - V n ) - ( V n - Wn ) - M 1 W n 



A n = A n n 



a n e" AfT 



P Po Pi 



and H n (0) = N F g (n, a)a n e 



n „— Act 



(21) N 
Therefore, we have: 



a n e~ Xa 



A" n n!N 



N = n 



1 



-Tin 



fe(a)=N[a n F e (N,a)l {R>0} 
and, thanks to (|2ip . for n > 1: 

/^ n) (0) = n!N \F e (n,a)l w 



L {N=n} 



\ n H n (8). 



We use the description of [15] of the reduced tree spanned by n leaves under the cr-finite 
measure N: it is a uniform binary tree with n leaves and with edge lengths i.i.d. and 
"distributed" as adh. We denote by [0,£i] the edge of the reduced tree attached to the 
root, and by H its length. The time L n at which the last coalescent event occurs is just the 
first time at which a mark appears on [0, x±J and therefore it is, conditionally given H = h, 
exponentially distributed with parameter 2ah. Moreover, if we denote by and the 
two sub-trees attached to x\, and respectively their total mass, we have: 

ff = o-W + o-W + 5>, 

iel' 

where the <Tj's are the total mass of the sub-trees attached on the edge [0,xi]. The random 
measure ^2iel' Ui ^ s i n dependent of and is, conditionally on {H = h}, distributed 

as a Poisson point measure with intensity 2ahN[da]. Eventually, the two reduced sub-trees 
attached to x\ are independent and distributed as uniform binary trees with respectively k 
and n — k leaves (0 < k < n). Recall that 2aKN [l — e _ACT ] = 2\/a\h. We deduce from this 
description that: 



Ajt, 



a e 



-2Va\h 



dh 



J™2ahe- 2ahd de Y,(^)jH k {e)H^ k {e) 



(22) 



de 



{e + ^x/^y 



G n {9), 
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with 

. n n—1 / \ .. n— 1 /• \ 

(23) G B ((?) 4E U ^M) = \T. ( J/f(0)/i n_fc) (0)- 

fc=i ^ ^ fe=i ^ ' 

Let Co = # + yA/a and g = cq — fg, so that: 

5(a) = \J 5o + 25\yJl — a — 62a. 

If /i is a function, we denote d™ =0 h(a) for /^(O). Then, using the formula for the n-th 
derivative of a product of functions, we have for n > 2: 



1 n ~ 1 / \ 

°rM = 2 E U > W (0)5 (n - fc) (0) = j 5« =0 5 2 (a) - 5 (0) 5 (n) (0). 
fe=i ^ ' 

That is, since 



r(n-i) 



(24) G n (0) = fc^oVT^ - V<^o + 2(5! «^(0) = -«i^-=^ - A /^+2^ (n) (0) . 

2V7T 

The next Lemma gives an equivalent expression for (^(O). 
Lemma 5.2. VFe /iaue: 

</W(0) 0! 1 



lim — 



Proof. Using that the density, which corresponds to the density of the 1/2-stable subordinator 
with no drift: 



■ / \ °l r 1 -S 2 r 2 Ir - 

h{x) = v^^ e {x>0} 



has Laplace transform: 

f +00 

e~ Ax h(x) dx = e" 



/ 

J 



we can write: 



ofal - — !— f°° —(l- e -(So+25 1 VT=E-8 2 a)A 

= J_ r5 ir ^ e -^ 2 /^l_ e -5or-, + a( 52 r + ,)\ 

27r J r d / 2 J a^/ 2 V / 

We deduce, with y = r/x that for n > 1: 

2vr y Jo Vy \ y 

_ ftr(n) f°° dy_fl + 5 2 y\ n 
2vr 7 ^ V 



e -^(<5o+^ +5? 2/) 



ftr(n ) f 00 

2vr 



1/ / (<*o + | +<%) n 
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with 

/ n l + fcy 

1 + 8 y + 8{y l 

To get an equivalent expression for gW(0), we use Laplace's method. Notice that <p(0) = 1 
and <p'(0) = 8 2 - 8 < thanks to (jnj). Notice that o~ > and 81 > 0. 

The only root to <p(y) = 1 is 0. Let us assume there is a root to <p(y) = — 1. This implies 
that 1 + S y + <5 2 y 2 + 1 + <5 2 y = that is: 

(<5 + 8 2 )y + ,5 2 y 2 = -2. 

But we have: 

80 + 8 2 > 9 2 + - - eV\a > 0. 
a 

Therefore, |<p(y)| G (—1,1) for y > 0. Since, limy_> +00 </?(y) = 0. 
The discussion above proves that: 

38 > 0, V<5 G (0, J ), Vx > 5, |v?(x)| < p(<?) < 1. 

Moreover, we have: 

lim log(^ = L 

Therefore we can apply the standard Laplace's method and get the following asymptotic in 



11 : 



aM(O) ~ r ^ = 6lT{n) r(1/2) ~ ^^-h) 1 



where a n ~ 6 n means \\m. n ^ +00 a n /b n = 1. □ 
Recall the notations of Proposition I4.2t 



*(p> Po, Pi) = pi ( 1 - P \ Pl , 1 - Po 



Lemma 5.3. H^e /ioue, /or p,po,pi G [0, 1]: 



lim N (1) 



P Po Pi 



*(p> Po,Pi) 



Notice that ^(1,1,1) = 1(0,0) = 1. This implies that (U n ,V n ,W n ) converge in distribu- 
tion, as n goes to infinity, to an a.s. finite random variable (U, V, W) and that the generating 
function of (U — V, V — W, W) is given by \£. 

Proof. On the one hand, we deduce from (I24p and Lemma 15.21 that: 

On the other hand, we deduce from ([20]) (by considering p = po = Pi = 1) and ([2~Tj) that: 



(25) F n (0) < N 



a n e~ Xa 



Recall ([6]). By decomposing an ordered binary tree with n labelled leaves in two ordered 
binary sub-trees attached to closest node of the root, we get: 

n-l 



^ ] ( , j CkC n —k = Cn- 
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This readily implies: 

n-l 

£ 



k=l 



i/l/,-±)r(»-*-i 



We deduce from the first equality of (|23l) and (l25j) that: 

n-l 



A 



2aV7r 



r n- - 



This implies that: 



r (n - i) " 2a0? 
By dominated convergence, we deduce from (|22p that: 

d6 G„(0) 



lim -; 

n^+oo r [n — |) 



lim 



2y / Q!'/T 

Va 



ede 



a 



00 0d0 



+ 



\\/e 2 + e^(2-p-pi) + ±a-po) J 
o + i \ 



2v^ Jo (# + l) 2 \^ 2 + ^(2-/>-Pi) + (1-Po) 
We deduce from (fTU|) that: 



lim ^4 r , 

n— >+oo 



2v/a7r , AM„ 
lim 



\/X n^+co r (n - |) 
00 / 



+ 1 



/o (# + l) 2 \ vV /^ + ^2_p_ /0l ) + (i_ po ) 

Then, we use Lemma 15.51 below to conclude. 

Before stating and proving Lemma 15.51 we first gi ye a preliminary result. 

Lemma 5.4. For a > ; 6 > and a + b > 0, we have A(a, 6) = Ao(a, 6). 

Proof. We first assume 1 + b — 2a > 0. From (|27|) . we have: 

d# 



1 



□ 



-d a A(a,b) 



(8 + l)(9 2 + 2a9 + 6) 3 /2 
-2d b I(a, b) 



A(a,b) 



+ 



1 



1 + 6- 2a 1 + 6 - 2a ^6 + a 

Then, by computing the derivative d a Ao, we deduce that A(a, b) = Ao(a, b) + hb(a), for a 
function hb solving: 
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h b (a) 



c+(b) 



VI + 6- 2a 

a. (6) 



Similarly, we have that for 1 + b — 2a < 0, 

A(a,6) = A (a,6) , _ _. 

V |1 + o - 2a | 

for some function c_. Notice that A and Ao are by definition (|27p and (jlip continuous on 
(0, +oo) 2 . By letting a goes to (1 + b)/2, we deduce that c+ = c_ = 0. This proves the 
result. □ 

Recall the definition of I given in (|13|) . We set for a > 0, 6 > 0: 



o + 1 \VW+2aJ+b + 1 
Lemma 5.5. For a > 0, b > 0, we have J (a, b) = I {a, b). 

Proof. We first notice that: 

alO 1 



(26) 



f (0 2 + 2a0 + 6) 3 / 2 V6 + a 
For a > 0, b > and a + b > 0, we set: 



and 



d0 



1 



(27) 

We have: 



A(o,6) = 



o (e 2 + 2a6 + 6) 3 / 2 ^(v^ + a) 



-dbJ(a, 6) = - 



o (# + 1)702 + 200 + 6' 
>d0 



l){6 2 + 2a0 + 6) 3 /2 
d0 



2(1 + 6 - 2a) 7o V0 2 + 2a0 + 6 
A(a,6) 1 



6 + b 



1 



"2(1 + 6- 2a) + 2(1 + 6- 2a) 



e 2 + 2a6 + b 6 + 1 
1 6 



(28) 

We also have: 

-d a J(a,b) 



A(a,6) 



"2(1 + 6- 2a) + 2(1 + 6- 2a) ^6 + a 

r°° e 2 de 



Vb + a Vb{Vb + 
Vb+1 



< + l){9 2 + 2a9 + 6)3/2 



1 



d0 



1 + 6 - 2a 7 V0 2 + 2a9 + b 



(6 - 2a)6 -6 1 



02 + 2a# + 6 + 1 



A (a, 6) I 



1 + 6- 2a 1 + 6- 2a 



6 -2a 



Vb + a Vb(Vb + a) 



(29) 



A(a,6) 



l + 6-2a + l + 6-2a Vb + a 
After computing d a A(a,b), we deduce from (p9|) that, for a + b > 0: 

J(a, 6) = - A(a, 6) - log (y/b + a) + g(b), 



1 b-Vb-2a 



18 



ROMAIN ABRAHAM AND JEAN-FRANQOIS DELMAS 



for some function g. Then computing dbJ(a,b), we get using (|28|) that g(b) is a constant c. 
Eventually, on the one hand taking a = \fb = 1, we get: 



+ 1 9 + 1 



0. 



On the other hand, we have: 

J(l, 1) = -A(l, 1) - log (2) + c = -1 - log(2) + c. 
This gives c = 1 + log(2). We get that: 

J(a, b) = -A(a, b) - log (yi + aj + 1 + log(2). 

That is / = J for a + b > 0. Then, we use the continuity of / and J to get I = J. □ 
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